An Algebraic Topological Approach to Privacy: Numerical and Categorical Data

نویسندگان

  • Alberto Speranzon
  • Shaunak Dattaprasad Bopardikar
چکیده

In this paper, we cast the classic problem of achieving k-anonymity for a given database as a problem in algebraic topology. Using techniques from this field of mathematics, we propose a framework for k-anonymity that brings new insights and algorithms to anonymize a database. We begin by addressing the simpler case when the data lies in a metric space. This case is instrumental to introduce the main ideas and notation. Specifically, by mapping a database to the Euclidean space and by considering the distance between datapoints, we introduce a simplicial representation of the data and show how concepts from algebraic topology, such as the nerve complex and persistent homology, can be applied to efficiently obtain the entire spectrum of k-anonymity of the database for various values of k and levels of generalization. For this representation, we provide an analytic characterization of conditions under which a given representation of the dataset is k-anonymous. We introduce a weighted barcode diagram which, in this context, becomes a computational tool to tradeoff data anonymity with data loss expressed as level of generalization. Some simulations results are used to illustrate the main idea of the paper. We conclude the paper with a discussion on how to extend this method to address the general case of a mix of categorical and metric data.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Functorial semantics of topological theories

Following the categorical approach to universal algebra through algebraic theories, proposed by F.~W.~Lawvere in his PhD thesis, this paper aims at introducing a similar setting for general topology. The cornerstone of the new framework is the notion of emph{categorically-algebraic} (emph{catalg}) emph{topological theory}, whose models induce a category of topological structures. We introduce t...

متن کامل

ON ALGEBRAIC AND COALGEBRAIC CATEGORIES OF VARIETY-BASED TOPOLOGICAL SYSTEMS

Motivated by the recent study on categorical properties of latticevalued topology, the paper considers a generalization of the notion of topological system introduced by S. Vickers, providing an algebraic and a coalgebraic category of the new structures. As a result, the nature of the category   TopSys   of S. Vickers gets clari ed, and a metatheorem is stated, claiming that (latticevalu...

متن کامل

Categorically-algebraic topology and its applications

This paper introduces a new approach to topology, based on category theory and universal algebra, and called categorically-algebraic (catalg) topology. It incorporates the most important settings of lattice-valued topology, including poslat topology of S.~E.~Rodabaugh, $(L,M)$-fuzzy topology of T.~Kubiak and A.~v{S}ostak, and $M$-fuzzy topology on $L$-fuzzy sets of C.~Guido. Moreover, its respe...

متن کامل

Bernoulli collocation method with residual correction for solving integral-algebraic equations

The principal aim of this paper is to serve the numerical solution of an integral-algebraic equation (IAE) by using the Bernoulli polynomials and the residual correction method. After implementation of our scheme, the main problem would be transformed into a system of algebraic equations such that its solutions are the unknown Bernoulli coefficients. This method gives an analytic solution when ...

متن کامل

Preserving Micro Data Release: Categorical and Numerical Data

Data mining techniques, in spite of their benefit in a wide range of applications have also raised threat to privacy and data security. All the attributes in a data base table can be classified into three categories as identifying attributes, sensitive attributes and quasi-identifier attributes. KAnonymity is the popular approach for privacy preserving data mining and the problems with Kanonymi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1602.06643  شماره 

صفحات  -

تاریخ انتشار 2016